Probabilistic Disease Classi cation of Expression-Dependent Proteomic Data from Mass Spectrometry of Human Serum
نویسندگان
چکیده
We have developed an algorithm called Q5 for probabilistic classi cation of healthy versus disease whole serum samples using mass spectrometry. The algorithm employs principal components analysis (PCA) followed by linear discriminant analysis (LDA) on whole spectrum surface-enhanced laser desorption/ionization time of ight (SELDI-TOF) mass spectrometry (MS) data and is demonstrated on four real datasets from complete, complex SELDI spectra of human blood serum. Q5 is a closed-form, exact solution to the problem of classi cation of complete mass spectra of a complex protein mixture. Q5 employs a probabilistic classi cation algorithm built upon a dimension-reduced linear discriminant analysis. Our solution is computationally ef cient; it is noniterative and computes the optimal linear discriminant using closed-form equations. The optimal discriminant is computed and veri ed for datasets of complete, complex SELDI spectra of human blood serum. Replicate experiments of different training/testing splits of each dataset are employed to verify robustness of the algorithm. The probabilistic classi cation method achieves excellent performance. We achieve sensitivity, speci city, and positive predictive values above 97% on three ovarian cancer datasets and one prostate cancer dataset. The Q5 method outperforms previous full-spectrum complex sample spectral classi cation techniques and can provide clues as to the molecular identities of differentially expressed proteins and peptides.
منابع مشابه
Proteomic Analysis of Gene Expression in Basal Cell Carcinoma
Background: Basal Cell Carcinoma (BCC) is a type of non-melanoma skin cancer. Alteration in gene expression is the important event that happens in cancer cell. Detection of this event is possible by proteomics techniques. Methods: Normal and tumor tissues were taken from BCC patient. Total proteins were purified by standard methods, and proteins were separated by two-dimensional electrophoresis...
متن کاملProbabilistic Disease Classification of Expression-Dependent Proteomic Data from Mass Spectrometry of Human Serum
We have developed an algorithm called Q5 for probabilistic classification of healthy versus disease whole serum samples using mass spectrometry. The algorithm employs principal components analysis (PCA) followed by linear discriminant analysis (LDA) on whole spectrum surface-enhanced laser desorption/ionization time of flight (SELDI-TOF) mass spectrometry (MS) data and is demonstrated on four r...
متن کاملA New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum
Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...
متن کاملQuantitative Proteomic Analysis of Serum Proteins from Oral Cancer Patients: Comparison of Two Analytical Methods
Serum proteomic analysis can be a valuable approach for the discovery of protein biomarkers for early detection or monitoring of a disease. In this study, two analytical methods were compared for quantification of serum proteins in patients with oral cancer. In the first approach, we quantified serum proteins between oral squamous cell carcinoma (OSCC) and healthy control subjects by performing...
متن کاملAlpha-1 antitrypsin, retinol binding protein and keratin 10 alterations in patients with psoriasis vulgaris, a proteomic approach
Objective(s):Psoriasis is an autoimmune disease that appears on the skin. Although psoriasis is clinically and histologically well characterized, its pathogenesis is unknown in detail. The aims of this study were to evaluate the proteome of psoriatic patients' sera and to compare them with those of normal healthy human to find valuable biomarkers. Materials and Methods: In a case-control study,...
متن کامل